12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(19) World Intellectual Property Organization 

International Bureau 

(43) International Publication Date 
26 July 2001 (26.07.2001) 




PCT 



(10) International Publication Number 

WO 01/53486 Al 



(51) International Patent Classification 7 : C12N 15/12, 

C07K 14/47, 16/18, C12Q 1/68, G01N 33/574, C12N 
15/62 

(21) International Application Number: PCT/USOO/03565 

(22) International Filing Date: 1 1 February 2000 (1 1.02.2000) 

(25) Filing Language: English 

(26) Publication Language: English 



(30) Priority Data: 




US 


PCT/US99/05028 


8 March 1999 (08.03.1999) 


60/123,972 


11 March 1999(11.03.1999) 


US 


60/133,459 


11 May 1999 (11.05.1999) 


us 


PCT/US99/12252 


2 June 1999 (02.06.1999) 


us 


60/140,650 


22 June 1999 (22.06.1999) 


us 


60/140,653 


22 June 1999 (22.06.1999) 


us 


60/144,758 


20 July 1999 (20.07.1999) 


us 


60/145,698 


26 July 1999 (26.07.1999) 


us 


60/146,222 


28 July 1999 (28.07.1999) 


us 


60/149,395 


17 August 1999 (17.08.1999) 


us 



60/151,689 


31 August 1999 (31.08.1999) 


US 


PCTAJS99/20111 


1 September 1999 (01.09.1999) 


US 


PCTAJS99/21090 


15 September 1999 (15.09.1999) 


US 


PCT/US99/28313 


30 November 1999 (30.11.1999) 


us 


PCT/US99/28301 


1 December 1999 (01.12.1999) 


us 


PCT/US99/28634 


1 December 1999 (01.12.1999) 


us 


PCT/USOO/00219 


5 January 2000 (05.01.2000) 


us 



(71) Applicant (for all designated States except US): GENEN- 
TECH, INC. [US/US]; 1 DNA Way, South San Francisco, 
CA 9408CM990 (US). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): ASHKENAZJ, 
Avi, J. [US/US]; 1456 Tarrytown Street, San Mateo, CA 
94402 (US). GODDARD, Audrey [CA/US]; 110 Congo 
Street, San Francisco, CA 94131 (US). GODOWSK1, 
Paul, J. [US/US]; 2627 Easton Drive, Burlingame, CA 
94010 (US). GURNEV, Austin. L. [US/US]; 1 Debbie 

[Continued on next page] 



= (54 ) Title: COMPOSITIONS AND METHODS FOR THE TREATMENT OF TUMOR 

55 KrviRFFPAASATKRVLPPVLRVSSPRTVMPNVPESPRIPAPRLPKRMSGAPTAGAALMLCAATA^LSAQGGPV 

55= cfoTolSSmo^HCTA^^ 

= IrlhrlpISSSqS 

== efw^lek^itgd^srSvqlrdtognael 

5= SSr^^^ 



ID 



Transmembrane domain: 
N-glycosylation site: 



Amino acids 51-70 
Amino acids 224-228 



cAMP- and cGMP- dependent protein kinase phosphorylation sites: 

Amino acids 46-50 ; 118-1^2 



N-myristoylation sites: 



Amino acids 50- 56 ; 129-135 ; 341 -347 ; 
357-363 



Fibrinoqen beta and gamma chains C-terminal domain signature: 

Amino acids 3 96-4 09 

(57) Abstract: The invention concerns compositions and methods for the diagnosis and treatment of neoplastic cell ^wA and 
pro iferation in mammals, including humans. The invention is based upon the identification or genes that are ampW ha ^the genome 
of tumor cells. Such gene amplification is expected to be associated with the overexpression ot the gene product » " 
normal cells of the same tissue'type and contribute to tumongenesis. Accordingly, the proteins encoded by the 
believed to be useful targets for the diagnosis and/or treatment (including prevention) ot certain cancers and may act as predictor 
of the prognoses of tumor treatment. The present invention is directed to novel polypeptides and to nucleic acid molecules e^hng 
those polypeptides. Also provided herein are vectors and host cells comprising those nucleic acid sequences, chimeric po ypept de 
moScuS comprising the'polypeptides of the present invention fused to heterologous polypeptide sequences antibodies which bind 
to the polypeptides of the present invention and to methods for producing the polypeptides ol the present invention. 



WO 01/53486 Al 



Lane, Belmont, CA 94002 (US). HILLAN, Kenneth, J. 
[GBAJS]- 64 Seward Street, San Francisco, CA 94114 
(US) MARSTERS, Scot, A. [US/US]; 990 Cherry Street, 
San Carlos, CA 94070 (US). PAN, James [CA/US]; 2705 
Coronet Boulevard, Belmont, CA 94002 (US). PITTI, 
Robert, M. [US/US]; 1110 Liberty Street, El Cemto, CA 
94530 (US) ROY, Margaret, Ann [GB/US]; 2960 Web- 
ster Street #4, San Francisco, CA 94123 (US). SMITH, 
Victoria [AU/US]; 19 Dwight Road, Burlingame, CA 
94010 (US). STONE, Donna, M. [US/US]; 684 Sierra 
Point Road, Brisbane, CA 94005 (US). WATANABE, 
Colin, K. [US/US]; 128 Corliss Drive, Moraga, CA 94556 
(US). WOOD, William, L [US/US]; 35 Southdown Court, 
Hillsborough, CA 94010 (US). 

(74) Agents: BARNES, Elizabeth, M. et al; Genentech, Inc., 
1 DNA Way, South San Francisco, CA 94080-4990 (US). 

(81) Designated States (national): AE, AL, AM, AT, AU, AZ, 
BA, BB, BG, BR, BY, CA, CH, CN, CR, CU, CZ, DE, DK, 



DM EE, ES. FT, GB, GD, GE, GH, GM, HR, HU, ID, IL, 
IN, IS, JP, KE, KG, KP, KR, KZ, LC, LK, LR, LS, LT, LU, 
LV MA, MD, MG, MK, MN, MW, MX, NO, NZ, PL, PT, 
RO, RU, SD, SE, SG, SI, SK, SL, TJ, TM, TR, TT, TZ, UA, 
UG, US, UZ, VN, YU, ZA, ZW. 

(84) Designated States (regional): ARIPO patent (GH, GM, 
KE, LS, MW, SD, SL. SZ, TZ, UG, ZW), Eurasian patent 
(AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), European patent 
(AT, BE, CH, CY, DE, DK, ES, FI, FR, GB, GR, IE, IT, LU, 
MC NL, PT, SE), OAPI patent (BF, BJ, CF, CG, CI, CM, 
GA, GN, GW, ML, MR, NE, SN, TD, TG). 

Published: 

— with international search report 

For two-letter codes and other abbreviations, refer to the "Guid- 
ance Notes on Codes and Abbreviations" appearing at the begin- 
ning of each regular issue of the PCT Gazette. 



WO 01/53486 



PCTAJSOO/03565 



0 



of Washington, Seattle, Washington). One or more of the ESTs used in the assembly was derived from a library 
constructed from tissue obtained from the parotid (salivary) gland of a human with parotid cancer. The consensus 
sequence obtained therefrom is herein designated as DNA56019. 

In light of an observed sequence homology between the DNA56019 sequence and Incyte EST clone no. 
1327836, Incyte EST clone no. 1327836 was purchased and the cDNA insert was obtained and sequenced. The 
sequenced this cDNA insert is shown in Figure 27 (SEQIDNO:27) and is herein designated as DNA64884-1527. 

The entire coding sequence of DNA64884-1527 is included in Figure 27 (SEQ ID NO:27). Clone 
DNA64884-1527 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 79-81 and ending at the stop codon at nucleotide positions 391-393 (Figure 27). The predicted 
polypeptide precursor is 104 amino acids long (Figure 28; SEQ ID NO:28). The full-length PR01245 protein 
shown in Figure 28 has an estimated molecular weight of about 10,100 daltons and a pi of about 8.76. Analysis 
of the full-length PRO 1245 sequence shown in Figure 28 (SEQ ID NO:28) evidences the presence of a variety of 
important polypeptide domains, wherein the locations given for those important polypeptide domains are 
approximate as described above. Analysis of the full-length PR01245 sequence shown in Figure 28 evidences the 
.5 presence of the following: a signal peptide from about amino acid 1 to about amino acid 18; N-myristoylation sites 
from about amino acid 8 to about amino acid 14, from about amino acid 65 to about amino acid 71, from about 
amino acid 74 to about amino acid 80, and from about amino acid 88 to about amino acid 94; and a prokaryotic 
membrane lipoprotein lipid attachment site from about amino acid 5 to about amino acid 16. Clone DNA64884- 
1527 has been deposited with ATCC on August 25, 1998 and is assigned ATCC deposit no. 203155. 
20 An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 

alignment analysis of the full-length sequence shown in Figure 28 (SEQ ID NO:28), evidenced some homology 
between the PR01245 amino acid sequence and the following Dayhoff sequences: SYA_THETH, GEN11167, 
MTV044.4, AB01115U, RLAJ2750.3, SNEUPTRA.l, S63624, C28391, A37907, and S14064. 



EXAMPLE 17 

25 Isolation of cDNA Clones Encoding Human PRQ1759 

DNA76531-1701 was identified by applying the proprietary signal sequence finding algorithm described 
inExample2above. Use of the above described signal sequence algorithm allowed identification of an EST cluster 
sequence from the LIFESEQ^ database, Incyte Pharmaceuticals, Palo Alto, CA, designatedDNA10571. This EST 
cluster sequence was then compared to a variety of expressed sequence tag (EST) databases which included public 
30 EST databases (*,., GenBank) and a proprietary EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo 
Alto, CA) to identify existing homologies. The homology search was performed using the computer program 
BLAST or BI ^(Altshulstd Methods in F.nzvmologv. 266:460-480(1996)). Those comparisons resnlting 
in a BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and 
assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, 
35 Seattle, Washington). One or more of the ESTs used in the assembly was derived from pooled eosinophils of 
allergic asthmatic patients. The consensus sequence obtained therefrom is herein designated as DNA57313. 

In light of an observed sequence homology between the DNA57313 sequence and Incyte EST 2434255, 



-127- 



WO 01/53486 



PCTAJS00/03565 



the clone including this Incyte EST 2434255 was purchased and the cDNA insert was obtained and sequenced. The 
sequence of this cDNAinsert is shown inFigure29(SEQIDNO:29)andis herein designated as DNA7653 1-1701. 

The entire coding sequence of DNA76531-1701 is included in Figure 29 (SEQ ID NO:29). Clone 
DNA76531-1701 contains a single open reading frame with an apparent translation^ initiation site at nucleotide 
5 positions 125-127 and ending at the stop codon at nucleotide positions 1475-1477 (Figure 29). The predicted 
polypeptide precursor is 450 amino acids long (Figure 30; SEQ ID NO.30). The full-length PR01759 protein 
shown in Figure 30 has an estimated molecular weight of about 49,765 daltons and a pi of -about 8.14. Analysis 
of the full-length PR01759 sequence shown in Figure 30 (SEQ ID NO:30) evidences the presence of a variety of 
important polypeptide domains, wherein the locations given for those important polypeptide domains are 
10 approximate as described above. Analysis of the full-length PR01759 sequence shown in Figure 30 evidences the 
presence of the following: a signal peptide from about amino acid 1 to about amino acid 18; transmembrane 
domains from about amino acid 41 to about amino acid 55, from about amino acid 75 to about amino acid 94, from 
about amino acid 127 to about amino acid 143, from about amino acid 191 to about amino acid 213, from about 
amino acid 249 to about amino acid 270, from about amino acid 278 to about amino acid 299, from about amino 
15 acid 314 to about amino acid 330, from about amino acid 343 to about amino acid 359, from about amino acid 379 
to about amino acid 394, and from about amino acid 410 to about amino acid 430; a cAMP- and cGMP-dependent 
proteinkinase phosphorylation site from aboutamino acid 104 to aboutamino acid 108; N-myristoylation sites from 
about amino acid 11 to about amino acid 17, from about amino acid 18 to about amino acid 24, from about amino 
acid 84 to about amino acid 90, -from about amino acid 92 to about amino acid 98, from about amino acid 137 to 
20 about amino acid 143, from about amino acid 138 to about amino acid 144, from about amino acid 238 to about 
amino acid 244, from about amino acid 253 to about amino acid 259, from about amino acid 278 to about amino 
acid 284, and from about amino acid 2S2 to about amino acid 288; an amidation site from about amino acid 102 
to about amino acid 106; and a prokaryotic membrane lipoprotein lipid attachment site from about amino acid 6 to 
about amino acid 17. Clone DNA76531-1701 has been deposited with ATCC on November 17, 1998 and is 

25 assigned ATCC deposit no. 203465. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 30 (SEQ ID NO:30), evidenced sequence identity 
between the PR01759 amino acid sequence and the following Dayhoff sequences: OPDE.PSEAE, TH1 l.TRYBB, 
S67684, RGT2_YEAST, S68362, ATSUGTRPR.l, P_W17836 (Patent application W097 15668- A2), F69587, 

30 A48076, andA45611. 

EXAMPLE 18 
Isolation of cDNA Clones Enco ding Human PRQ5775 
DNA96869-2673 was identified by applying the proprietary signal sequence finding algorithm described 
in Example 2 above. Use of the above described signal sequence algorithm allowed identification of an EST cluster 
35 sequence from the LIFESEQ® database, Incyte Pharmaceuticals, Palo Alto, CA, designated herein as CLU86443. 
This EST cluster sequence was then compared to a variety of expressed sequence tag (EST) databases which 
included public EST databases (e.g., GenBank) and a proprietary EST DNA database (LIFESEQ®, Incyte 
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HCT116, SKCOl, and LS174T. 

Because amplification of DNA58801-1052 occurs in various tumors, it is highly probable to play a 
significant role in tumor formation or growth. As a result, antagonists (e.g. , antibodies) directed against the protein 
encoded by DNA58801-1052 (PR0779) would be expected to have utility in cancer therapy. 

5 PRQ1185 (DNA62881-1515): 

The ACt values for DNA62881-1515 in a variety of tumors are reported in Table 7A. A AGt of >1 was 
typically used as the threshold value for amplification scoring, as this represents a doubling of gene copy. Table 
7A indicates that significant amplification of nucleic acid DNA62881-1515 encoding PR01185 occurred: (1) in 
primary lung tumors: LT3, LT30 and LT26; and (2) in primary colon tumor CT2. 
10 Because amplification of DNA62881-1515 occurs in various tumors, it is highly probable to play a 

significant role in tumor formation or growth. As a result, antagonists (e.g., antibodies) directed against the protein 
encoded by DNA62881-1515 (PRO 1185) would be expected to have utility in cancer therapy. 

PRQ1245 (DNA64884-1527): 

The ACt values for DNA64884-1527 in a variety of tumors are reported in Table 7A. A ACt of >1 was 
15 typically used as the threshold value for amplification scoring, as this represents a doubling of gene copy. Table 
7A indicates that significant amplification of nucleic acid DNA64884-1527 encoding PRO 1245 occurred: (1) in 
primary lung tumors: LT13, LT15 and LT16; (2) in lung tumor cell line H522; and (3) in primary colon tumor 
CT15. 

Because amplification of DNA64884-1527occurs in various tumors, it is highly probable to play a 
20 significant role in tumor formation or growth. As a result, antagonists (e.g. , antibodies) directed against the protein 
encoded by DNA64884-1527 (PR01245) would be expected to have utility in cancer therapy. 

PRQ1759 (DNA7653 1-1701 ): 

The ACt values for DNA7653 1-1701 in a variety of tumors are reported in Table 7B. A ACt of >1 was 
typically used as the threshold value for amplification scoring, as this represents a doubling of gene copy. Table 
25 7B indicates that significant amplification of nucleic acid DNA76531-1701 encoding PR01759 occurred: (1) in 
primary lung tumors: HF-000840 and HF-001296; and (2) in primary colon tumor center HF-000795. 

Because amplification of DNA7 653 1-1 701 occurs in various tumors, it is highly probable to play a 
significant role in tumor formation or growth. As a result, antagonists (e.g. , antibodies) directed against the protein 
encoded by DNA76531-1701 (PR01759) would be expected to have utility in cancer therapy. 

30 PRQ5775 (PNA96869-2673): 

The ACt values for DNA96869-2673 in a variety of tumors are reported in Table 7B. A ACt of >1 was 
typically used as the threshold value for amplification scoring, as this represents a doubling of gene copy. Table 
7B indicates that significant amplification of nucleic acid DNA96869-2673 encoding PR05775 occurred: (1) in 
primary lung tumors: HF-00063 1 , HF-00064 1 , HF-000643, HF-000840, HF-000842, HF-001293, HF-001294, HF- 

-185- " 



